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related data) of incur^ants on a sMiple of jobs in terms^ of the nine tests 
qf the General Aptitude ' Test Battery (GAtfB) of the Unitea States Training 
and Employment Service. ' 

Since the GATB tests are not available for general use, the present . 
study was directed toward the use of the PAQ as the basis for predicting 
test-^related data for various coimnercially-available tests whioh' were con-^ 
sidered to measure the same ^'constructs" as those Masured by the GATB 
tests. Data were obtained for a sample of 96 jobs, the data consisting of a 
PAQ analysis for each job and data for the incumbents on various cornmercially 
available tests. Depending on the nati^ra of the test data available for 
individual jobs, four types of criteria' tore used as indications of the 
*'importapce" of Individual tests for each , job in .question. Thes^ criteria 
were^i (1) the mean test ^core of incui^ents on the jobi (2) a "potential", 
cut-off score (which was the score which had actually been used as a "cut-- 
off" for selection puxposas by the organisation which supplied the data); 
(3) a validity doefficienti and (4) =an indication as totwhether the test 
would be 'Valid^^ for the job. ^ The data for the various jobs for which any 
given test had been used ware ^Suped into categories for the ^ "constructs'' 
represented by the individual GA'i'B tests. (Adequate data were available for 
only five of , the nine constructs^ The available "norms'' of the. tests re- 
presenting ariy given construct were converted to a set of standard scores 
with a mean of 100 and a standard deviation of 20 (that used with the GATB 
tests). The PAQ job dimension- scorei were then used as predictors of what- 
ever test-related criteria were available for.any given construct. 

\. The PAQ-based predictions- for the criteria of mean test scores and cut^ 
of f scorai: ware all highly significant. However, the predictions for actual 
validity coefficients were understsmdably low. The prediction of whether or 
not specific tests would be ^"valid" ware also significant (in 75 percent of ^ 
the cases for which the PAQ predicted a test would be valid, the tests 
indeed proved to be valid predictors) * ' - 

Although the predictions supported the utility of the PAQ-bassd job- 
component validity model, a number of problems probably resulted in overly 
conservative predictions, '^his particular report is of a preliminary 
nature, based on the data' available at the. time. A later analysis will be 
carried out within a few months after the sample sis.e is increased. 
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INTRODUCTION 



The conventional method for identifying personnel tests =to be , 
-1 -J iplection of personnel for various jobs consists of 
"^f'fl^^inf of telts SofeSch particular job in question. This pro- ^ 
Sdirf involves! Ilf thf administratis of a sample of tests to 

^tuay. Thus, °verjh3 years ^there have^been certj^ ^.^ 
-to develop some 'VP^ °« ,f S procedure that would be es- , 

-„^jL?rris?:rLihrs5s 

test r.te?rel='^of f IS n"^ t o ?^ U 

j°L??rtrb»^ 

described by Balina (1959, p. 353) as rox4.ow_ -nalvsis of ioba into 
validity in a specifio. situation from a ^^^^^^^ ^m^^nts , 
their elements, a determxnation of test validity ^ „ 

and a combination of elemental ^^^^f ^^f^.^v^^/ f specif i- 
the term synthetic validity has been ,f ^74! ^hls suggested 

cally appropriate to such procedure, McCormick (ly/^; na yy 

"r.e°L^SLr.ent^:ra°p?cSeSLr^fo? ^stabXishin. -eVjob co.gon- 

eat validity o£^,prediotors for J-r^LStirinf tr^S?- 

' {McCormicK, et.al.^ ^I'V : L^h ^re referred to as job elements by 
1±^T 'iirT:ll,o6 ^fordiSr:5h"?/£or"an experimintal sample of 



jobs, the human attribute (s) required for successfur job performance 
v/hen *a given job component is common to several jobs; and (3) 
some method of combining the estimates of human attributes required 
for individual job components into an overall estimate of the human 

' attribute requirements for an entire job. . Such a procedure would 
make it possible to "build-up"' the aptitude requirements for any 
given job by i (1) knowing what job component s occur in the job in 
question; (2) knowing what aptitudes are required for each such 
component ; and (3) knowing what aptitudes are required for each 
such component; and (4) - having a procedure for summating the at^ 
tribute requirements that are relevant to the individual job com-- 

' ponents. 

The Position Rnalysis Questionnaire ( PAQ) 

Various procedures have been used in the development of some 
type of job component or generalized validity procedure. One of 
these has involved the use of the Position Analysis Questldnnaire , 
(PAQ) . The PAQ is a structured job analysis questionnaire that pro- 
vides for the analysis of individual jobs in terms of 187 job ele- 
ments. In the analysis of jobs with the PAQ job elemfsnts, various--" 
rating scales are used (the particular rating scale used £or each 
job element being the one for which" the concept of the scale seems 
particularly appropriate to the element). Most of the scales are 
six-po^int Likert-type scales, ranging from zero (does not apply) to 
five (the. highest valu^;) . The various Scales used include those 
dealing with importance, time, extent of use, and in some instances 
special scales. In certain instances a dichotoffious scale is used. 
The dichotomous scale provides , for indicating whether the job ele- 
ment in question does, or does .not, apply to the job. 



Principal components analyses of the PAQ.' Data based on the 
• PAQ havb been s^ubjected to various principal components analyses. 
Of particular relevance to our present interest are those carried 
out by Jeannert and McCormick (June 1969) , and by Marquardt and , 

' McCormick (1974a). The first of these was based on a sample of 536 
jobs, and resulted in the identification of 32 principal comporients 
which are referred to as job dimensions. 27 of theee .wefe based on 
the principal components analyses, of the job elements within each of 
the six divisions of the PAQ, and the other five were based on the 
principal components analysis of most of the job elenients of the PAQ 
pooled together. The study' by Marquardt and McCormick was based on 
a sample of 3700 jobs, and resulted in the, Identification of 30 job 
dimensions .resulting from the principal components analyses of the 

■ job elements in each of the six PAQ divisions, and 14 based on an 
overali or 'genera"'. principal components analysis using the pooled 

* elements from all six of thfe PAQ division. 
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Use of the PAQ to establish job component validity . One of 
the primary uses^ of the PAQ has been in the framework of estab- 
lishing the job component validity of tests for various jobs. 
This has consisted primarily of the analysis of samples of jobs 
for which test data for the job incumbents were' available from the 
United States Training and Employment Service (USTES) , and for 
which PAQ analyses were available. The USTES publishes test data 
for incumbents on several hundred jobs^ the test data consisting 
of normative and validity data for the nine tests of the General 
Aptitude Test Battery (GATB)p These tests are as follows s 

G - Intelligence 
V - Verbal Aptitude 

N - Niimerical Aptitude ' 

S - Spatial Aptitude ^ , 

P ^ Form Perception 

Q - Clerical Perception 

K " Motor Coordination 

F - Finger Dexterity - ^ 

M - Manual De ' xty 

As the primary approach to the use of PAQ-^based data in-^the job 
component validity framework^ samples of jobs were selected which 
"matched" the ones for which the USTES has published test normative 
and/or validity data. In these analyses, the primary criterion of 
the "importanae" of a given test to any gUven job consisted of the ' 
mean GATE test scores of the Incumbents on, the individual jobs. 
This criterion was based on the assiimption that individuals tend to 
"gravitate" into jobs 'which are commensurate with their own abili- 
ties. Thus, it would be assumed that jobs for which the incumbents 
have high mean test scores on a given test would reqiiire mora of the 
quality measured by the test than jobs for which the incumbents have 
lower mean test scores/ Using mean test scores as a criterion^ the 
scores on the PAQ job dimensions were then used in a regression pro-- 
.cedure for the prediction of the mean GATE test scores. 

Two such studies have been carried out. The first of these ^ 
by^Mecham and McCormick (1969)^, involved a sample of PAQ analyses 
for 179 positions which "matched" 90 jobs for which the USTES pub- - 
lished test data. In this instance the PAQ job dimensions that were 
used as predictors were those developed by Jeanneret and McCormick 
(1969), In the second study PAQ analyses for 659 positions were 
matched with 141 jobs for which ^the USTES had published test data 
(Marquardt and* McCormick {1974b), In th^ case of both of these 
studies ^the prediction of the mean test scores of the incumbents 
from PAQ job dimension scores was quite respectable. The ranges and 
medians of the multiple correlations across the nine GATB tests re- 
sulting from these studies are given below. 



Mecham and McCormick 



Marquardt and McCormick 



Corrplations 



Range 
Median 



.59 to ,80 
. ^71 



,46 to 

,73 



.76 



In both of. these studies the predictions of the cognitive 
tests .were best, those of the perceptual tests were intermediate, 
and those of the psychomotor tests were the lowest. Although 
there were differences in the predictions for the various types 
of tpsts, the general level of prediction was viewed as demon- 
strating the potential utility of the use of such a procedure for 
the establishment of the jcb component validity of personnel se- 
lection tests. 

Objectives of the Present Study 

The use of PAQ job dimension " scores for the prediction of 
mean GATE test scores of "hypothetical" samples of job incumbents 
on various jobs clearly can give some indications of the apti-. 
tudes that- presumably would be required for individual jobs. How- 
ever, since the predictions are in terms of GATB test scores, and 
since the 'GATB tests are not available, for use by private organi- 
zations, l^he operational use of such predictiohs would necessitate ^ 
that the predictions in terms of the nine GATE tests' would' have to 
be "converted" into terms corresponding to those of commercially- 
available tests. Thus, it would be desirable to develop some 
procedure- for use of the GATE test score predictions as the basis 
for the sf^lection of corresponding commercially-available, tests, 
and for the estimation of scores for such tests which correspond 
to those of the GATB tests. Thus one could use predictions of 
appropriate GATB test score cut-offs as the basis for .deriving • 
estimates of cut-ff scores on other {corresponding)-tests which 
v/ould be comparable to those of the GATB test in question. The 
basic objective of the present study has been that of developing 
^ome procedure for shifting from the prediction of GATB test scores 
to the prediction of scores on commercially-available tests that 
presumably correspond with those of the several GATB tests. 

There are two possible general approaches to the matching 
of GATE and commercially-available tests that might serve as the 
bksis for converting from one to another. The preferable approach 
-would be one for which data are available for two tests that -are 
based on the scores of the individuals in a "general population 
who have taken both tests. The equivalence of two such tests • 
would best be reflected by a high' correlation between the two . > 
in turn, corresponding norms for the two tests preferably should be 
available for the general population, in order to make.it possible 
to "co'nvert" scores from one test to equivalent scores on another 
test in terms of either standard deviation units . from the. mean, or., 
in terms of percentile norms. The USTES has published data on 
certain commercial tests that have been administered to the same 
samples of individuals who have taken certain of the GATB tests. 
These data, however, were found not to be particularly useful for 
Sis study since msny of the. tests for which such data were pre- 
sented were those more typically used in educational circumstances 
rlthlr than for personnel selection. ^ Also many of the samples of 
inSvIduals represented in the normative data f^^'^ff^,,^^^^ 
or of individuals on given occupations ^rather than of the general 
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population." . In most instances normative data were simply not 
available. 

The second approach is one in whiGh a judgment needs to be 
made about the equivalence of test content, or the equivalehce of 
the "GonatruQt" that presumably is being measured, by the two tests 
in question/ This is admittedly a subjective evaluation ^ and there- 
fore needs to be approached with caution. In the. case of some pairs 
of tests there is no paxticular problem in making a reasonably valid 
judgment about their equivalence ^ but in the case of other tests 
the subjective judgment may not be entirely valid. In those in- 
stances w)^ere tests are considered to be equivalent^ there is of 
course thte further possible problem of conversion of norms from one 
test to "^hose for the second test, .Many of the norms presented in 
test mantials are for individuals on ^certain' jobs or in various job 
grouping|b, without there beiilg norms available for what might be 
viewed as a "general" population, - (It might be added that the norms 
for the GATB test have been based on a sample of 4000 individuals 
whose jqbs ' are reasonably representative of the major occupational 
groups of 'workers in the labor market, ^ Therefore, the application 
of this approach preferably would require the availability ,of a 
reasonably comparable set of norms for any other test that would be 
considered as being essentially "equivalent" to. one of the GATE ^ 
tests,) ' .. 



METHOD 

The primary focus of this study, then, was to develop some way 
of testing ^ the utility of a job^component validity models based on 
PAQ job dimensions, for use with commercially-available tests. To 
accomplish this it would be necessary. to translate the predictions 
for the GATE tests made by combining .the PAQ job dimension scores 
into terms relevant to commercially-'available tests representing^ 
similar aptitude" constructs , Theretore, this study was viewed as 
a test of the generalizability of the PAQ-based job-component valid^ 
ity model that has been heretofore tested only with GATE test data, 
.The basic approach used has been that of obtaining,- from vari^ 
ous organizations, data from any validity studies that they had 
carried out for jobs in their organizations, as well as obtaining 
PAQ analyses for any such jobs. Several approaches were used in 
an attempt to obtain such data^ including the following: 

'1. Direct mailing of letters to several hundred organizations, 
explaining the goals .of the project and asking them to 
submit any relevant test data they had available (validity 
information as well as normative data) , and asking them to 
' ' arrange for the analysis of jobs in question with the PAQ, 

2* Estrnbiishing contacts with various test publishing firms , 
= asking them if they had any validity or normative data on 

^'any of the tests which they published for incumbents on 
specific jobs. 

■ ' ' ■ 11', . 



3. Mailings to consulting firms which it was believed 
were involved in test validation studies,, 

4. Mailings to former graduate students of Purdue Univef'- 
sity. 

5. Appeals made in certain publications which it was felt 
had the audiences that it might be useful to c^jact, - 
asking if anyone had the kinds of data we were seeking 
These'^appeals included articles in The Industrial Psy - 
chologist (TIP) (the newsletter of the Division of 
Industrial/Organizational Psychology of the M\erican 
Psychological Association) , and the Personnel Adminis - 
trator , which is the official organ of the American 
Society for PersonneT Administration* 

Although the combination of all of these sources yielded 
test data for incumbents for a moderate number of jobs, there was 
still a problem in the ca^^e of certain jobs. In the case of cer- 
tain jobs for which test aata were available the organization or 
individual furnishing the test data were unable or unwilling to ar- 
range for the analysis of those jobs with the PAQ, In order to 
include these jobs in our sample, it was decided to "match" these 
jobs with jobs which had already been analyzed with the PAQ and 
which v/ere in the PAQ data bank (presently consisting of some 20,000 
jobs). This matching was carried out on the basis of job code 
numbers from the Dictionary of Occupational Titles (D.O,T.)/ which, 
although admittedly imperfect, seemed to provide a reasonable basis 
for matching jobs. Even after augmenting the sample in this way/ 
however,^ the total sample still consisted of only some 58 jobs* This 
v/as not seen as a large enough sample to allow for any meaningful 
analyses. Thus, other attemps to obtain relevant normative and/or 

validity data were made. u- i 

In an attempt to enlarge .the sample of jobs, some archival _ 
data were used. Such data consisted mainly of validity or normative 
data for various tests for incumbents on different jobs as reported 
in sources such as the Validity Information EKchange of Personnel 
Psyc hology , The Handbook of Employee Selection (Dorcus and Jones, 
1950) . and the manuals for various tests commonly used ^in industry. 
The PAQ analyses for the jobs for which data were, obtained through 
these sources had to be obtained by matching these jobs v/ith ^obB 
in the PAQ data bank, hs before, this was done on the -basis of ; 
D.O.T, code numbers. inis archival data yielded data on an addi- ^ 
tional 38 jobs, bringing the total sample for this study to96 jobs. 

^It should be noted that obtaining test data and PAQ analyses for 
jobs so they could be included in this study was the major diffi- 
culty encountered in the project. The present sample took almost 2 
years to collect, and new approaches to data collection are cur 
rently going on. It .was felt that a preliminary report should at 
least be prepared to describa this line of research, but it is hoped 
that present efforts will allow a more extensive analysis later with 
a much larger sample. 
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With this final sample of 96 jobs for which there were avail= 
able test data from either an orgahizati'on , or various archival 
sources^ as well as a PAQ ahalysiV for each job, the following 
.operations were carried, out ^ //^ ' , --.^ 

1. JDhe commercially^avai/iable tests for which data were now 
available, ^ had to be /matched to individual QATB tests 
which were judged to measure, the same "constructs." 

2^ A method had^to be developed which would allow the 

equating of the norms for the cbrnmercially^available 
1^ tests with the norms of ^the corresponding GATB tests* 

3. Analyses ^, would be "carried out relating, for each job, 

the PAQ predicted GATB test data, to the data bn the com= 
mercially-available test in question, ^ This comparison 
would- revolve aroundi (1) the mean test scopes; ^ (2) 
what were referred to as "cut-off "scores ; (whi^ch in most, 
instances consisted lOf the test score one/ standard de-^, 
viation below the mean of the scores of the Incumbents 
■ ' on the job in question); p) valicaity coefficients;, 
and (4) the determinatidn of whether or not. the -test ' 
. would be a "valid" predictor of performance fnr the job 
r: in question. 

Although the. total sample cunsisted of 96 jobs, a small 
portion of the sampie was based on data for jobs in each of 
seven clusters which had been formed by one organization*. 
(These clusters had^ been used in a previous test validation 
study the prganization had carried out*) PAQ analyses were 
available lor certain of the jobs within each cluster,, but ' ' 
not for all jobs. The test data for. the incumbents of all of 
the jobs within each' cluster were not dif f erentiarted by specif 
Ic job* Thus, it was not possible to relate PAQ analyses for 
the individual jobs with the test data for the incumbents on 
those. same identical jobs, In view of this a "composite" 
PAQ analyses were then used as the "predictors" of the test 
dati for the incumbents on the jobs in the clusters* 

In view of the special treatment of the .data for the jobs 
in these seven clusters a complete, set of analyses was also 
carried out for all of the jobs excluding these* This set of - 
analyses then was based on a subsample "^bf 89 jobs (96 minus 
the 7) , ..^ r, " . , 
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Constructs Us^d in the Study '■ ■ ' 

: AS was mentioned above. ■ this study "^^i^f 
a number of commercially-available tests to the GATE tests. This 
presented a bit of a problem in that the.factor.al compositjon^of 
= r^r.a■^■^n^-^ V " s iml lar " tss t s IS sometimes quite different, and tnus 
?nests'of! £or"Kample, verbal ability, may actualjy^be tes.ing^ 
somewhat different. abilities or attributes. ^r°5nizing |his prob 
?Sm -it was nonetheless decided that the only feasible way that 
Ss stud? cSull be carried: out would to consider as ^^^^ 
Ih^ qame "construct," all of the commercially-available tests that 
putport to measure the same cdnetruct- as that represented jny ^ 
giJIn GATE test. Although the actual GATE tests were available, 
tSere Is information-'published concerning the g^peral nature of 
each subtest. Thus, fo^ exaiople, a test of "^"bal aptitude that 
consisted primarily of reading comprehension items would not ae 
incfidefas a test oflthe "Verbal Aptitude" construct as nieasured 
by theverbal (V) GATB test used in this study, to represent^ that 

^""^""lince the entire framework of this study revolved; around the 
GATE it vms only natural that the constructs which be of 

Interest would bS the nine measured by the subtests of ^the GATE , 
thele Icing General Intelligence (G),, Verbal Aptitude (V , Numeri-^ 

An?itude (m,^=patial Aotitude (S) , Form Perception (P) , Cleri 
III perception Q) - Motor Coordination (K) , Mngar Dexterity (FK ,., 
Snd .Mn.mal DeKterii-v (m : The commerce aUy-available tests used 
to measure each' construct , and the number-, of jobs for which each 
'such test was used are given in^ Appendix B. 

De velopment of Equiva lent Norms 

The determination of which commercially-available tests 
measured each of the constructs in question, although basically^ 
a judgemental question,- did not pose any serious prpblems. Oncj „ . 
•?he individual tests had been classified as measuring a specific 
construct': it was then .necessary to develop a method of equating 
Scores on each of the commercially-ayailable tests used to measure 
the construct to test scdres on the GATE subtests for that ^^me^^ 
const?"" The optimal procedure for accomplishing this would have 
construci^^ available normative test data for a single, general 



been to have avaiiatoJLe normative te&u i^aua ^.w^ ^ J 
popCla?i2n on all the tests: within a^P^^J J^f 
data were not available, however, and other methods had to be em 
Ployed ThSse methods involved the combining and synthesizing of 
gendi n^rm groups, and eventually all test scpres -f|^f|-^-J. 
in the same standard score units, those units based on the stanflarc 
score dJSJibution renewed for the GATE tests.-^ The GATB standard 
scores are based on a mln of 100 and a standard deviation of 20. 

morl detailed explanation of the methods employed to develop 
\hese. equivalent -norms /is given m Appendix C. _ _ i„,, = +.e 

" AS a result, for;' any construct, it was possible to locate, 
for each job for which test data for a .test measuring that con 
S^uct were available, the position on' the continuum of scores on 
Ihrcon^truct where /the snmpic of incun^bents on that n.ob would 
fall It was the scores of thene incumbents on the different con- 

^ . .' 14 ■ 



fitructs that WG^G us- 1 as criterion values in this study , such^ 
criterion value>' ' i'or v arious jobs being viewed as reflecting the 
relative "importance" to the jobs in question of the construct m 
question, • ' 

A ctual Criteria Used . , , - ' 

Four different criteria as related to individual jobs were . 
used in the study,, these being considered as reflecting various .in- 
dices of .the "importance" to the individual/jobs of each of -the 
nine , constructs as represented by. the GATB tests. These criteria 
for- each job and ia each test, consisted .of i/ . (1) mean test score 
of the job incumbents on the individual job; (2) a potential^ 
cut-off score; ^3) a coefficient validity; and (4> an indi- 
.cation of wHethA^the test would be ''valid" for the job. The first 
two of these were 'considered to be the primary criterda used in ; 
the study. Considereing Vfor a nibment the^criterion of mean test 
scores of incumbents on the individual jobs; one could view. a con- 
tinuum for each of the nine constructs expressed dn standard score 
forrn with the mean scores of incumbents on the various jobs fall- 
ing in various positions alona that runtinuum, f rom low to high, 
(As indicated aarlier,^ i:ho conversion of the n rms of tho 
commercialdy-availabiG tests to the standard score ^ forms of the ^ 
GA?b Ss?s served as .ho conimori metric for relating the mean scores 
6f incumbents on that continuum.) In tha^ case^of ^^^f ^obs the. 
cut-oEf score criterion consisted of scores that were actually 



m^--otr score critierxuii ^^wiisj-s ^-^^ ^^^^^^ - ± 

used as cS?-of£ scores in selecting people for the jobs in question 
bv the orqanizations which had provided the test data-. In the 
cLe of rtlst jobs, however, "this was a ■■potential" cut-off score- , 
which was one standard deviation below the mean of the scores of 

incumbents on the job in question. . .. ^-i «n+-R it was 

In the case of the criterion of validity coefficients it was 
of course not necessary to be concerned about the normative^ ^ 
Sataihat were used with the criteria of mean test scores and pre= 
dieted cut-off scores. Rather, for any given construct, the co- 
efficients of validity of the tests which were considered to repre- 
sent tha^tconst^Sct could be viewed as representing a continuum 
?rSm low tS high as expressed by the actual coeff icient , values them- 
Se?vei ^n thl case of certain analyses' the fourth criterian was^ 
used namely an indication as to whether individual tests would be 
^"ik" p?edic?ors of performance. This criterion, was based on the 
qulstion'^as to whether the initial coefficient of validity for that 
test itself was one which was statistically significant or not. 

Development' PT-r^dinted Criterion Values 

Thp 'predicted criterion values for the individual 'jobs were 
deriveS^fJom a standard computer printout of data that are printed 
from Ihe PAO analysis of any given job/ ^l^ese computer printouts 
are based on previous analyses of PAQ-based data I^^^^Jj^f • ^^.^^J, 



for 
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-.. ' ^■ 

the mean test score* of job incuinb«nts ^ the standard deviation of . 
the distribution of test scores of incumbents on each- job (to be 
used to establish a "cut-off" score) ^ and the validity coefficient * 
The computer printout/ based on regression analyses. of PAQ job 
dimension scores as related to these three , values , provides esti** 
mateSr.for any given job^ of the first three criteria. In con'- ' s 
nection with the. fourth criterion the computer program on which 
these computer printouts are based) also include^ provision fo^ 
making a prediction about those tests (usually three) which would 
be ^ valid predictors for performance on Jthe jpb in question* This 
particular aspect of ^,the program^ in effect^ is a "policy captur^ 
ing" procedure that parallels the practice of the USTES in its 
approach to the identification of the thf^e. "best" or most "valid" 
tests for use i if the seicrCtion of, individ\ials f or any given job, 
and in establishing cut-off scores for those three-^ tests The PAQ 
printout, in effect, provides estimates gf the cut-off scores of 
those three tests* Thus, for any given job, the ^'fourth criterion 
■cc^ri^ists of the identificatibn of three tests , 'which . are predicted 
to be the most "valid" for .use, in selecting. people for any given 
iob, bascKl on URTFF nractibes in this area. Thus, in the^caF-p nf 
the three tests identified by the computer program as bf ing most 
"valid" for any given construct, a determination would be made as 
to whether, that test in the actual validity setting, didv in fact 
turn out to^^have a significant validity coefficient for the job 
inv question, ^ . 



. Relating Predicted to Actual Criterion Values . 

^ As implied above, the predictions of -the fourV different 

criteria used in the study were derived from tlie conventional 
PhQ computer printout for the individual jobs used in the case of 
the analysis of any given construct. In these predictions there 
was of course the initial "selection", of jobs for which relevant 
test data were available for the incumbents as related to . any 
given construct. In addition, given, those jobs for which test J 
data for a given construct ' were available, th^re was a further -V^' 
selection, for individual analyses, of the types of test criterion 
data which were actually available. Thus, for any given construct, 
a job would be included in the analysis for any particular cri- 
terion, depending upon whether actual criteria data were available 
such as mean test scores, cut^-off "scores, validity coef f icientp , 
or an indication as to whether a coefficient was or was not valid,. 
Thus, the analyses consisted of a: series of sub--analyses for the 
'individual constructs, each sub-analysis consisting of data for ■ 
jobs for which both predicted and actual criterion data were avail' 
able. . : ' . 

In this process the actual (or obtained) test values for in- 
cumbents on any job, were converted to standard scores on the 
constructs involved. The predicted scores were all in terms of ^ 
GATB tests, but, since the constructs were defined in terms of the 
GATE subtests, andvsincc all scores 'were in a common metric, this 
provided no problem. The cibtual predicted scores themselves were . 



the raault of^ombining the FAQ job dinien'aibn\ scores for any jot, 
according to r&qre^sion equations/ developed dn earlier research 
designed to prediSl: GATE test scores; froni PhQ job dimension scores 
.{Mecham a^d> kcCormick, 1969)* These aquations were designed to 
yield niean test scores # potent^-al cut'-off scores r validity coeffi- 
cients/ as well as make some predictioh about which tests should 
be valid predictors of performance for a job. These predictions 
were then compared to the actual criterion data in these areas by 
means of* a series of correlational analyses, / 
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RESULTS 



For each of the nine- aptitude constructs ^represehted by the , 
nine tests of the General Aptitude ^ "Test Battery (GATB)^ Pearson 
produat-moment correlations ^ were compttted for each of the following 
sets of data, the analyses in each instance being btsed on those 
jobs for whiQb relevant test data and criterion data were available * 

(1) Predicted GATB mean test scores ,(as derivedf by procedures 
involving job analyiis data from the PAQ) and actual me^n. 
test scores obtained for incumbents on each of the joba 

in .the sample |. ^ ^ 

(2) Predicted GATE. cut-off scores (scores one standard de- 
viation below the prtedicted^^mean test scores) and (in 
the case of a few organisations) actual cut-off scores 
which had^ been set for each of^^the jobs in the sample ./ 
{the actual cut-off scores we^ hot necesBarily one stan--' 
dard deviation below the mean) ^ ' . ■ ' ^ 

(3) ^ Predicted validity coefficients ^' (dbtained from PAQ job 
' , . ^ analysis procedures) ^ ' r / ' 

iK addition to the Pearson product-mpment correlatlpns , a 
phi aoefficient was computed for each of the nine GATB aptitude 
constructs between t^e predicted validity df a particular^test 
(valid=l, not valid =0) as derived from PAQ procedures, and the 
actual validity of the tests (valid^l , ^not| valid^O), as obtained, in 
actual validation procfdures carried out by organizationB providing 
data for the present- study * The Rear son product^motnent corrclatins- 
as well as the phi coefficients computed fpt.each of the nine GATB ■ 
aptitude constructs are presented in Table' 1, 

One; should note that no. .test data were available f or ;^ either 
the Form Perception or Motor Coordination iconstructs In the cape 
of Finger Dexterity and Manual DeKterity, data were available on^ only 
seven jobs, th^se being the seven job clusters mentioned earl^ier iri 
this section. Data concerning these two ability areas probably / 
should be considered as 'essentially meaningless , ^becapse of tiie ^all^ 

sample size, . , , ^ , *^^4>,^+.« . 

The results reported in Table 1 would seem to indicate. a 
substantial relations^liP between PAQ predictions concerning mean, 

^^^iJofll^and cut^off^^.5Cores obtained for theoobs m the sample . 

^"lonsfdering; the relktionship between ^^^^^^f^f^^^ ^ 
test scores for those, ability areas with sufficiently large sample 
s?^ls trSarrlnt:consider^^ five of €ive correlations are signi^ 

f leant at the 03 level or better. The correlations range from 
: 3? Uatin! AbL to^.68 . (Clerical Ability ).- The results. co^ 

cerning predicted and actual cut-off scores are similar .Again, ^ 
When Slnsiaering cr,ly those .biUtyraraas ""J. f ^^uate^eampla =.,.as 
four-of four correlations are significant at the ,03 level or Detrer . 
Thf corrections range from;.28^: Xlntelligencel ^o ,70 (V 
Aptitude) * " ^ ' , - ^ 
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The relationship between predicted, and -jctual validity CO- 
, efficients is not as promising r however Only ona of the five cor-- 
..relations' is significant at the .05 lev^al , with it' being, in the 
opposite direction to what would he normally eKpected, The correla^ 
t ions range from -^*36 (Verbal Aptitude) to .19 (Numerical Apti.-^ 
tude) * The data relating tQ "^the predicted versus adtual , validity or 
non-'yalidity of the .particular tests show. conelderably stronger re^ 
.lationships than for the validity coeffioienti themselves. These 
Coefficients ranged from^ 15 (Intelligence) to .53 (Spatial Ap- 
titude) . It would thus seem that PAQ-^based data were relatively 
successful in predicting whether or npt partiqular tests wouldv 
prove to be valid predictors of job performance, ,^ 

One explanation of the relatively (moderately) low phi co-^ 
efficients obtained in the present study is that PAQ-based data are 
conservative in their prediction of the validity or non-yalidity of 
tests. Predictions from PAQ data would thus have a. tendency to pre-^ 
diet as invalid a number of tests which jnight actually prove to 
be valid indicatorG of job performance. As a^result^ a frequency 
count was made of only those cases where the .PAQ data predicted that 
a particular test would be Valid. In Table 2 are given the number' 
of. correct valid predict ions and the number of predictions which 
were incorrect^ as well as the percent of predictions made which 
were correct* Note that considering all five' ability areas where 
such data were available^ 75 percent of the cases in^which tests 
were predicted to be validf they were indeed valid predictors of 
job performance. These result's^ taken together with the phi coef- 
ficient' data^ would seem to suggest that, if anything^ predictions 
based upon PAQ data are conservative in nature> and ' are relatively 
accurate in their prediction of valid indicators of job perform^ 
ance. 



' ■/ ^ TABLE I . . ' ■ 

Correlations Between Predicted and Actual 
Mean" Test ^cores. Cutoff- Scores, Validity 
Coefficients, and Phi Coefficients for 
Valid - Mon Valid Tests 



Pearson Products-Moment Cprrelatione i Criterion of Mean Test Scores 





Total Sample 




Subsample N=89 




Test^ 


r 


significance 


N 


r 


significance 


N 


G-Intelliganca 


.32 


.011 ^ ^ 


49 


.48 


.001 


42 


V- Verbal Aptitude 


.52 


.001 


34 


.48 j 


.aoi 


34 


N-'Numerical Aptitude 


.54 


.001 


69 


.56 ' 


.001 


62 


S-^Spatiaa Aptitude 


.30 


. 03 0 


39 


. 29 


■ .050 . 


32 


PrForm Perception 


* 


* 


*• 


'* 


* 


* 


Q-Clerical Fferception 


.68 


.001 


31 




.001 


31 


^'K^Motor Coordination 


* . 


* 


* 


* 






F-Finger Dexterity 


.02 


.480 , 


7 


* 






M-'Manual Dexterity - 


.67 


.050 


7 


* 




if 


Pearson Product^Moment Correlations i Criterion of 


Cutoff Scores 




■ Total Sample 




Subsample N=S9 




Test . ^ 


r 


significance 


1 


r 


significance 


1 


G--Intelligence . 


.28 


.02 8 • 


47 , 


'39 


- .006 


40. 


V^Verbal . Aptitude 


.70 




34 


.70 


,001- 


34 


N-NumericaJ Aptitude, 


.61 


■001 


45 


.64 


,001 


38 


S-Spatiai Aptitude 


.53 


.024;, > 


14 


.53 


.110 . ^ 


7 


P-^Forin Perception 


■' * , 




'- * 


* 


* 


* 


Q<'leria;alPerceptioh 




* 


* 


* 


* 


* 


K=Motor Coordination 


* 


i 




* 


* 


* 


F-Finger Dexterity - 


.20 


.329 


7 


* ' 


* 


* 


M-Manual Dexterity 


.23 


- .305 


7 • 


* 




* 



Insufficient number Of cases for analysis 
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= ^ TABLE 1 (Con' t, ) 



Pearson Product--Moment Correlations i Criterion of ^Mlidity Obef £ icien ts 

Subsample N^Sg 
k aiqni^f ioance . N- 





Total Sample 




Test 


r 


significance 

r - - 


N 




• 26 


.l9i'i ■ 


13 / 


V-Verbar Aptitude 


• 36 


.022 


31 


N--NumeriGal Aptitude^ 


,19 


.072 


56 


S-^Spatial .Aptitude 


,15 


.204 


29 


P-Form Perception 


* 


. *, 




Q-Clerlcal Perceptibn 


• 16 


.182 ' 




K-Motor Cobrdination 


* 


* 


UU 


F--Finger Dexterity 


* 


* 


UU 


M-^Manual Dexterity 


* 


* 




PYii Coefficients: Criterion of Valid ve 


1 Non^ 




Total Sample^ 




Test . ^ 


r 


significance 


w 


Intelligence 


,15 


.303 \ ' 


13 


V=-Verbal Aptitude 


,49 


;003 


31 


N^Mumerical Aptitude 


• 30 


.011 


56 


S-Spatial ' Aptitude 


,53 


.001 


29 


P^Form Perception 








O^Clerical Perception 


.01 


.485 


34 


K-^Motor Coordination 


* 


* 




F^Finger Dexterity 




* 




M-'Mahual Dexterity 


* 


* 





Same as 
for total 
sample 



SubBample N^89 
r significance 



Same as . 
for total 
s ample 



Insufficient number of cases tor analysis 
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, • : . TABLE 2 . 

Frequency Count of Correct and Incdrrect 
^ Predictions Only When PAQ-based Data Predict ^ 
' Tests to be Valid Indicators of Job Performance 





No* Correct ^ 


NO * mcorrec u 


£^ ^ W W A i 




Predictions 


Predictions 


Correct 








Predictions/ 










X E a U ■ , 




. * . . 


Correct 


G-lntelligence 


. 9 ' 


3 


7 b 


V--Verbal Aptitude 


7 


0 


100 


N-Numerical Aptitude 


15 




7^ 


S^Spatial Aptitude 


3 ; 


1 


68 " 


P=Porm Perception 


* 




* 


Q-Clerical Perception 


• . 13 


6 ^ . - 


_ 68 ^ 


K=Motbr Coordination 


ii 




* 


F-Finger Dexterity 


* 






M-Manual Dexterity 


* 


* 


* 


All Tests Together 


. AT - 


16 : 


75 



*InsuCficient number of cases for analysis 



i .- - . ■ 
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DISCUSSION 



It would seem, based on the results reported here, that PAQ- 
based data dan indeed serve as the basis of a job component jali- 
dity model as reflected by the fact that there ^=is J reasonable ^ 
relationship between, test score data predicted by the PAQ, and 
Actual test- score, data for job incumbents resulting from actual 
test validity studies. Although data were not f^^lable in all the 
areas tested by. the GATE, for those areas for which adequate datft 
w^e avSfJablef there were significant- relationships J^^" 
dieted and obtiiried mean test scores, and between predicted and ob- 
?aihSdlut=off scores. Furthermore, for the tests which were pre= 
dieted' to be valid by the PAQ, in only 25 percent: of the cases were 
the tests not actually reported to be valid'. These results are 
Nearly indicative , of the utility of PAQ^based data in a^^ob com^ 
ponent ■ validity model in and of themselves, and, when all 
the data are taken into consideration, the .support is rather i^- 
prlssive, especially considering certain obvious shortcomings m the 
available^data._^^ these possible shortcomings for example, ^t^e 
regression equations used to predict test data were not based on 
ISLercially-available tests such as were used in this study, but 
were based on GATE data. Although the feests used m this study were 
mltlhed with the corresponding GATE tests as well as possible, it is 

Obvious Satlall tests Classified as "P-^-J^i"! ..^LSucf "Ihuf 
are not necessarily measuring the same "identical construct. Th^s, 
wlhave a regression equation derived to predict scores on one test 
of lav verbal aptitude, and we are usuig that regression equation' 
lo'preducrtest data based on other tests of "verbal, aptitude" which 
mly^afllast be- s^ewhat different in content, ^he^magnitude of^ 
the obtained relationships becomes even more impressive in part be 
cause of the possible "slippage" from this possible disparity. _ 
-""'\hare arS^ther problems as well. ^^e process of equating the 
^^^^ -hhe rCiffRrent tests may also hava added some error van 

ancf iSo SrprSluon s?stem,'^although this probably did not play 
fmljS ?oL in reducing the P"di=tabillty of test^score^^^ ^ 
'mnfh more serlous problem probably Stems from the very nature ot tne 
various vafldi?y%tudies which^ provided the data which we were try- 
ing to predict. Of course, there was^ no 'control on the design of 
tui ./fliifittv studies which served as the sources of -the criterion 
test data' and nof only did the quality of the studies seem to vary 
™Lhat but thp criteria used in the different studies, and, all 
etullef haf Len cllr^ed out in the same fashion, it might have been 
I^'^iir to oredict the' outcome from the PAQ-based equations. 

Snforlunatlly Ihese (and perhaps other) problems are typical 
of those which involve the collection of already existing research 
Sata from a Jarie^y 6f organizations. .(The data obtained from^rats 
in 5 laboratorrce^tiirily can be obtained under much more neatly 
"controiled" Snditibns than obtaining human data from the real 
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world of work.) The riature of this project required the collec-- 
tion of test validity and/or normative data from various organ.i.^.a— 
tions, and it would be inevitable that the adequacy of the v^irious 
test validation studies used would , have been' variable, and that the. 
criteria used in them would have varied. The collective effect ok 
some of these problems and of their effects On the. .summarizing and 
analysis of the data would be expected to reduce the strength, of 
the' actual relationships between the predictors and the criterion 
values, thus representing a conservative estimate of the predict- 
ability of the PAQ-based data. In the light of these constraints, 
the results seem "generally to give further ...support to the potential 
use of a structured job analysis procedure as the basis for estab- 
lishing personnel specifications for jobs. . 4. 

This particular report is presented as a , preliminary, report 
covering relevant data that were available. Efforts are being made 
to obtain, for additional, jobs , relevant-test data and PAQ analyses 
toward- the end of building up a larger sample for a subsequent.^ 
analysis within the next several months. In. connection with the 
analysis it is expected that" a more recently-developed set? of PAQ ^ 
■ job dimensions will be- used, specifically a set that would be based 
on a. larger and more representative sample of jobs. It is also ^ 
hoped that data for an expanded sample of jobs would include test 
validity and/or normative data for a wider range of , commercially, 
available tests, thus providing a basis for greater •generalisation 
of PAQ-based job datm for' the estimation of personnel requirements 
of jobs using the basic job component validity model. 



List of Reftfences . 

Balina^ J.J. The development of proGesses for indirect or synthetic validity 

(A symposium) 1 1. the concapt. of synthetic validity. Personnel Psychology ;' 
1959, 12, 395-396. . . 

porcus, R-M. and JonesVM.H* Handbook off Eniployee Selection y New Yorki McGraw- i 
Hill, 1950 : ■ ' . " - ' ; 

Jeanneret, P'iR. and McCormick, E.J* The job dimensions of "worter-oriented" 
job variables and of their attribute profiles as based on data from the 
Position toalysis Questionnaire. Occupational Research Center, Purdue 
University ^ June 1969 , (Prepared for the Office of Naval Research under 
^contract NONR - 1100 (28) , te^ort No. 20 

Lawshe, C.H* P^at cam industrial psydhology do for small business (A symposium) s 
2, Bnployee selection . ^ Personnel Psychology , 1952, 5^, 31-34* 

McCormick, E.J* The application of structured job analysis .^infoCTnation based 
on the Position Analysis- Questionnaire (PAQ) . Occupational Research 
Center, Purdue University , October 1974. (Prepared for the Office of . 
" ^ Naval ^Resarach under contract N00014-67-A-0226-0016 , Report No. 9*) 

Marquardt , ,^L,D. and McCormick, E* J. T^ie job dimensions underlying the job 
1 elements of the Position Analysis Ouestionnaire (PAQ) (Form B) Occupa-- 
tioijal Research Center, Purdue University , ;i974a. (Prepared for the . 
Office of Naval Research under contract N00014-67=A-0226-0016 , Report No. 4.) 

' ■ ^ ■ - ■ ■ ■ . \/ ^ 

Marquardt, L.b. and McCormick, E*J. The utility of job dimensions based on y 
form B of the Position toalysis Questionnaire (PAQ) in a job component ^ 
validation model* Occupational Research Center, Purdue University, 
i974b. (Prepared for the bffice of yaval Research under contract 
N00014=67^A-0226^0016, teport No* 5.) 

McCormick, E*J^, Mecham, R*C. and Jeannera^, P*R* Technical Manual for the 

' Position Analysis Questionnaire (PAQ) . West Lafayette, ^ Indiana^ University 
Bookstore, 1972* , . 

Mecham, R*C* and McCormick , E * J * The use of data^based on the Position Analysis* 
Questionnaire in developing synthetically-derived attribute requirements of 
jobs* Occupational Research Center ,^ Purdue University, 1969* (Prepared 
^ for^he Office of Naval Research under cpntract NONR-1100 (28), Report No* 4, 

U.S, Dapaw-'ment of Labor, Manpower Admini Station. Manual for the USTES General 
' Aptitude Test Battery *. . Section III f Development * Washington, DC, 1970. ^ 

U,S, Employment Service, pictionary of Occupational Titles * Vol . II ■ Occupational 
Classification ^ (3rd ed.) Washington, DCs U.S * Government Printing Office, 
1965, ~ - / ■ ■ ' ' . ' ■ 



25 



APPENDIX A 
Jobs Included in the Sample 

Jobs for which data were obtained directly from/various 
organizations I . , 

Job Title ^. r^" D.O.T. Number * 

Feeder Catcher 619*886 
General Factory Worker ' ' 899.381 

Roll Catcher ' 643.886 

Blotter . . 651.782 

Utility 922.883 
Stripper 749.8 87 

Apprentice Steelworker ** 
Bundler Stacker 221.388 
Oil Maintenance and Operator (trainee) ** 
Oil Assistant Operator 
Business Forms Pressmen 
Special Officer (Dept. Store) 
Saleclerk 

Pipefitter (trainee) 
Oil Refinery Process Trainee 
Computer Operator 
■General Clerk 
Billing Checker 
Requisition Handler 
Order .Filler 
Stock Assistant 
Typist 

Shipping Packer 
Shipping Checker 
Sorter 

Returned Goods Receiving Clerk 
Material Handler 
Induction Clerk 
Bookkeeper 
Keypunch Operator 
Calculating Machine Operator 
Record Clerk 
Correspondence Clerk 
Timekeeper 
Buyer ' s- Assistant 
Receiver Checker ' 
Secondary Receiver 
Industrial Truck Driver 

There v/ere also several clusters of jobs which have no particular 
title since they are actually composites of several jobs included 
in this sample. These clusters resulted from various research ef= 
forts on the part of the organizations, and were treated as in--^ 
dividtial jobs, 

^ *Code number from the Dictionary of Occupational Titles 
**The title as furnished by the company covers several related 
D.O«Ti classifications • 



542.280 
** 

290.478 
862.381 
452.280 
213.282 
209.388 
209.688 
221.388 
922.887 
223.387 
112.800 
227.587 
209.688 
222.687 
222.387 
929.887 
161.688 
210.388 
213,582 
216.488 
222.587 
204.288 
219.388 
223.368 
222. 6J7 
222.587 
905.883 
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Jobs for which data were obtained from journals, test manuals 
etc*: ' ' 

Job Title ."' D.O.T. Number* 



824.281 

209.388 ■ 



Electrician 
Bank Clerk 

Bookkeeping Machine Operator ^ii'^So 
Secretary 
Meter Reader 

Mail Clerk • ■ 

Shipping Clerk 
File Clerk 
Gas Serviceman 
Yard Clerk 
Industrial Engineer. 
..Electrical Technician 
Packer 
Machinist 

Subscription Clerk 9/ia ^cr 

Telephone Service Representative rio tri 

Aircraft Inspector Inlflft^ 
Truck Driver III nil 

Telephone Operator ^^?'r78 
Plant worker itt'tai 
Assembler -jiq ina 

coding Clerks cti i,i 

Aircraft Manufacturing Foremen bZJ..iJJ, 

Draftsman 



201.368 
239.588 
231.588 
222.587 
206.388 
637.281 
910.388 
012.188 
003.131 
920. 887 
638.281 
, 209 .488 



Accountant _ " 



005.281 
160.188 



insurance Sales Representative ' N ^50.258 

Programmer oAi'-t^a 

Claims Adjustor • ' ■ lig 488 

Accounting Clerk 5i^'382 

computer Operator . 207:388 

Stenographer , ^gg 

Receptionist ' 212'.368 

Bank Teller , . qio tar 

Reservations Clerk 912-368 

^yP^f^ ' . ■ 105.000 

Keypunch Machine Operator fit 'oil 
Policeman ^, 

* Code number from the Dictionary nf Occupational Titles 
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APPENDIX B 



Tests Used to Measure The Various Constructs 



Intelligence 

Wonderliq 21 cases 

Otis Test of Mental Ability ^ 3 cases 
Test of Learning Ability - 6 cases 
. SRA Adaptability Test - 2 cases 
Special (or in house) - 7 cases 

Verbal Aptitude 

SRA==Verbal - 7 cases 
PTI-Verbal ^ 7 cases 
SET-Verhal - 2 cases 
EAS^Verbal /- 13 cases 

Numerical Aptitude 

SRA-Mumerical (or Arithmetic Form) - 3 cases 
EAS^Numerical - 14 cases 
SET=Numerical - ^ 3 cases 
.. PTi-^Numerical - 1 cases 

Arithmetic Fundamentals - 3 cases 
Arithmetic Reasoning Test 1 case 
FIT^Arithmetic - 4 cases 
I Special (or in*house) 31. cases 

Spatial Aptitude 

' Minnesota Paper Forms Board ^ 2 cases 

i EAS-^Spatial - 5 cases 

! SRA-Assembly - 1 case , 

i Special (or in-^house) 30 cases 

Clerical Perception 

; SET-Clerical - 3 cases 

Special (or in-house) - 24 cases 
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APPENDIX C 
DevelopinQnt of Norms 



In order to carry out this study, it was necesaary to have 
all test data in some connnbn framework ^ that Is^ all test 
scores had to be expressed in terms of some eonmon scale. In 
order to do this, it would have been desirable^ to have data on . 
all the tests used in this study for the same population. This 
was alearly.not possible. Therefore^ some alternative strategy 
had to be formulated:,'' Since the tests of the General Aptitude- 
Tests Battery (GATB) of the United States Training and' Employ- 
ment Service were used as the criterion tests in other studies 
involving the PAQ in a job component validity model .(Mecham and 
Mccormick, 1969; Ma'rquardt and .McCormick , 1973) , it was decided 
that the norms for the GATB would be utilized as the basic frame^ 
work. The GATB tests are reported , in standard score form, with 
a mean of 100 and a standard deviation of 20^ these values being 
derived from test data on over 20^000 people collected by the 
^United States Training and Employment Service. 

The major problem, however, ^ was with the other tests that 
would be used in this study. Although it was not very difficult 
to assign any given test to an appropriate construct^ since the 
"constructs" utilized in this study were of such a broad nature 
(for example, it seems obvious that the Short Employment Test of 
Verbal Ability should be classified as a "Verbal Aptitude" test) , 
there is no body of data indicating the norms for these tests on a 
"general"population • Not only were the different tests used with 
different. populations, but for most tests there was not a single 
population that could be considered as a "general working popula-^ 
tion" for which norms were supplied. Since that ^ is basically 
what the GATB norms are based .on, it seemed desirable to have all 
other norms based on the same kind of population. 

The first problem, then, was to construct general norms for 
each of the different tests- ^ The seriousness of this prbblem 
A^aried for the different tests.. For example, for the Employee 
Aptitude Survey Tests, norms are provided =for a general working 
pppulatioji, and so these norms were used. For the Otis Test of 
Mental Ability, norms are provided for a general population, but 
these norms ,are given separately for males and females, and so 
they Had to be consolidated into a single set of norms. For the 
various tests' published^ by the Psychological Corporation -(viz, 
'The Short Employmerit Tests and the Personnel Tests for Industry), 
no general norms are provided.'. Rather, there are norms provided 
for separate occupational groups. . In general, the number of 
occupational groups for which such norms were provided were too 
numerous to be reasonably consolidated, so these norms were 
sampled. That is, a number of groups which seemed to. have 
"high" norms were combined with. a number of groups which seemed 
to have "low" norms, yielding a single set of norms which it was 
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felt provided a fair representation. ,of a norm for a "general" 

population. ^ 

Thus, for each test a single set of norms was constri^ed^ 
recognizing that the populations deemed "general" were different 
for each test. Again, there was no way to avoid this lack of a 
single "general" population for whom scores were known on all 
tests used. The test scores which were obtained from either the 
organizations' involved in the study, or any of the other methods 
discussed earlier, for each job, were then standardized on the 
general norms established for the corresponding GATE test. This 
resulted in a standard score for each test, for the incumbents on 
each job, reflecting where mean test for incumbents on that job 
fell on the particular construct relative to all other jobs. 
• T'he problem still remained of converting all these scores to 
some common metric. As was pointed out above, the GATE. norms were 
to be used as the framework for this common metric. Since one set 
of standard scores is always directly convertible to any other set 
of standard scores, the scores for each job were then converted ■ 
into standard scores with a mean of 100 and a standard deviation ot 
20 For examnle, if the mean score for a sample of plumbers hap- 
tDened to be 40 on the' Revised Minnesota Paper Forms Board , and 
this was found to be equal to a standard score of .50 based on the 
general'population norms constructed for that test, conversion to_ 
the GATE norms was simply a matter of multiplying the GATE standard 
deviation of 20 by .5, (10), and adding this to the GATE norm mean 
of 100 (since the standard score was positive), yielding a GATB 
standard score of 110. This process was , repeated for each Dob and 
each test so that the final product was a continuum for each 
"construct," with the ' mean scores of incumbents on th-e various Dobs 
being assigned positions on this continuum based on standard scores 
with a mean of 100 and a standard deviation of 20. It was these 
converted scores that were used as the criterion values of mean . 
Stlcores" used in this study. Thus, since the PAQ_ based predic= 
tions are in terms of the GATE tests, and the criterion tests 
scopes for the various jobs were in GATE standard score terms, mean 
test score values for the two could be directly compared for any 
job! I somewhat similar procedure was used in deriving the predicted 
cut-off score criterion values, which were, one standard deviation . 
below the mean of the scores of incumbents on the individual Dobs. 
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